Robust Risk-Aware Reinforcement Learning
نویسندگان
چکیده
We present a reinforcement learning (RL) approach for robust optimization of risk-aware performance criteria. To allow agents to express wide variety risk-reward profiles, we assess the value policy using rank dependent expected utility (RDEU). RDEU allows seek gains, while simultaneously protecting themselves against downside risk. robustify optimal policies model uncertainty, not by its distribution but rather worst possible that lies within Wasserstein ball around it. Thus, our problem formulation may be viewed as an actor/agent choosing (the outer problem) and adversary then acting worsen strategy inner problem). develop explicit gradient formulae problems show their efficacy on three prototypical financial problems: portfolio allocation, benchmark optimization, statistical arbitrage.
منابع مشابه
Robust Adversarial Reinforcement Learning
Deep neural networks coupled with fast simulation and improved computation have led to recent successes in the field of reinforcement learning (RL). However, most current RL-based approaches fail to generalize since: (a) the gap between simulation and real world is so large that policy-learning approaches fail to transfer; (b) even if policy learning is done in real world, the data scarcity lea...
متن کاملRobust Reinforcement Learning inMotion
While exploring to nd better solutions, an agent performing on-line reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, or even catastrophic , results, often modeled in terms of reaching`failure' states of the agent's environment. This paper presents a method that uses domain knowledge to reduce the number of failures during exploratio...
متن کاملRobust Reinforcement Learning
This letter proposes a new reinforcement learning (RL) paradigm that explicitly takes into account input disturbance as well as modeling errors. The use of environmental models in RL is quite popular for both offline learning using simulations and for online action planning. However, the difference between the model and the real environment can lead to unpredictable, and often unwanted, results...
متن کاملDistral: Robust multitask reinforcement learning
Most deep reinforcement learning algorithms are data inefficient in complex and rich environments, limiting their applicability to many scenarios. One direction for improving data efficiency is multitask learning with shared neural network parameters, where efficiency may be improved through transfer across related tasks. In practice, however, this is not usually observed, because gradients fro...
متن کاملUncertainty-Aware Reinforcement Learning for Collision Avoidance
Reinforcement learning can enable complex, adaptive behavior to be learned automatically for autonomous robotic platforms. However, practical deployment of reinforcement learning methods must contend with the fact that the training process itself can be unsafe for the robot. In this paper, we consider the specific case of a mobile robot learning to navigate an a priori unknown environment while...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Siam Journal on Financial Mathematics
سال: 2022
ISSN: ['1945-497X']
DOI: https://doi.org/10.1137/21m144640x